PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.10G212600.1.p
Common NameGLYMA_10G212600, LOC100806246
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family MYB
Protein Properties Length: 1652aa    MW: 181032 Da    PI: 5.9196
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.10G212600.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding26.12e-08785826346
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
      Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                          +WT+eE e +++ ++ +G++ +++Ia+ +  ++t  +c+++++k
  Glyma.10G212600.1.p 785 PWTPEEREVFLEKFAAFGKD-FRKIASFFD-HKTTADCVEFYYK 826
                          8*****************99.*********.***********98 PP

2Myb_DNA-binding32.81.6e-109731012344
                           SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
      Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                            WT +E   +++av  +G++ +++Iar++g +R+ +qck ++
  Glyma.10G212600.1.p  973 DWTDDEKTAFLRAVSSFGKD-FAKIARCVG-TRSQEQCKVFF 1012
                           5*****************99.*********.********766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.51E-14769829IPR009057Homeodomain-like
PROSITE profilePS5129315.95781832IPR017884SANT domain
SMARTSM007179.4E-9782830IPR001005SANT/Myb domain
PfamPF002494.3E-6784826IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.1E-5785826IPR009057Homeodomain-like
PROSITE profilePS5129312.7239691020IPR017884SANT domain
SMARTSM007172.7E-89701018IPR001005SANT/Myb domain
SuperFamilySSF466891.17E-99711020IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.1E-69731012IPR009057Homeodomain-like
PfamPF002499.8E-99731012IPR001005SANT/Myb domain
CDDcd001671.20E-79741012No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1652 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSHHRDF NRWGSAEFRR PPGHGKQGGW  60
HLFSEEPGHG YAISRSSSDK MLEDDSRPSI SRGDGKYGRS SRENRGGPFG QRDWRGHSWE  120
PNNGSMNFPR RLQDVNNDQR SVDDALAYSS HPHSDFGNAW DQHHLKDQHD KMGGVNMFGT  180
GPRSDRDNSL GDWKPLKWTR SGSLSSRGSG FSHSSSSRSM GGADSHEVKA ELLPKSVAAN  240
ESHSGEAAAC ATSSVPSEDT TSRKKPRLGW GEGLAKYEKK KVEVPDASAN KEGPVLSTSN  300
TEPCNLLSPS LVDKSPKLLG FSECASPATP SSVACSSSPG MDDKLFGKTA NVDNYASNLT  360
GSPAPVSESH FARFSFNLEK FDIDSLNNLG SSIIELVQSD DPTSLDSGPM RSNSINKLLI  420
WKADISKVLE MTESEIDLLE NELKSLKSES GETCPCPCPV TLGSQMVGSD EKSCEEHVGV  480
SDQVIRPVPL KIVDDPNTEK MPLSTNLHSI HENGKEEDID SPGTATSKFV EPLPLIKAVS  540
CDTRGHDNFS RDLDTVLSTA VKCLVPCTTR KEASVPACVD GNISMELKDS MDILYKTIIS  600
SNKESANRAS EVFDKLWPKD CCKIEKMEAS SDACTHTFIM EKFAERKQFA RFKERVIALK  660
FRALHHLWKE DMRLLSIRKC RPKSHKKNEL SVRSTCNGIQ KNRSSIRSRF PFPAGNQLSL  720
VSTSEIINFT SKLLSESQVK VQRNTLKMPA LILDEKEKMI SKFVSSNGLV EDPLAIEKER  780
TMINPWTPEE REVFLEKFAA FGKDFRKIAS FFDHKTTADC VEFYYKNHKS DCFEKIKKQD  840
GDKLGKSYSA KTDLIASGNK KLRAGSSLLG GYGKVKTYRG EDFIEKSSSF DILGDERETA  900
AAADVLAGIC GSLSSEAMSS CITSSVDPVE GNRDRKFLKV NPLCKLPMTP DVTQDVDDET  960
CSDESCGEMD PTDWTDDEKT AFLRAVSSFG KDFAKIARCV GTRSQEQCKV FFSKGRKCLG  1020
LDLMRPIPEN VGSPVNDDAN GGESDTDDAC VVETGSVVET DKSGTKTDED LHLYGTNTYH  1080
DESHPVEARN LSAELNESKE INWTEVDLED ANVTSGACQI NIDSKQGCDG SEVFLCGSNK  1140
SGSVGERADI IMSDSTEVEN DKANKLGGAA TELISAPNTR EPCQSNSIAE DRMVVSEVSS  1200
GGLGNELERH RVSSTLCVDD RDNKHEADSG VIVDMKSSVH DLSTMINSSI SSLGNSCSGL  1260
SFSSENKHVP LGNPRVSALS MDNLHALLQN TVAVDVQCEK TASQDQMSST CDIRGGRDMH  1320
CQNSISNGDH QHITGNLSDH VDAVSILQGY PLQVPVKKEM DSDMNCTSSA TELPLLPQKI  1380
EHDDDHIKAF QSSDSDKTFR NGDVKLFGKI LTNPSTTQKP NVGAKGSEEN GTHHPKLSSK  1440
SSNPKITGHH SADGNLKILK FDHNDYVGLE NVPMRSYGYW DGNRIQTGLS TLPDSAILLA  1500
KYPAAFSNYL TSSAKLEQPS LQTYSKNNER LLNGASTFTT RDINGSNALI DYQMFRRDGP  1560
KVQPFMVDVK HCQDVFSEMQ RRNGFEAISS LQQQSRGMNG VGRPGILVGG SCSGVSDPVA  1620
AIKMHYSNSD KYGGQTGSIA REDESWGGKG D*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-16743834494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-16743834494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.66190.0cotyledon| flower| root
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006589438.10.0PREDICTED: uncharacterized protein LOC100806246 isoform X5
TrEMBLK7LKN00.0K7LKN0_SOYBN; Uncharacterized protein
STRINGGLYMA10G35671.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Representative plantOGRP32151725
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-172MYB family protein